PyDigger - unearthing stuff about Python

Found 2 out of 213,014. Showing 2 on page 1. Total pages: 1.

Name	Version	Summary	date
shtec-rlhf	0.0.3.dev0	shtec-rlhf: Safe Reinforcement Learning from Human Feedback	2024-05-20 15:34:00
trl	0.8.4	Train transformer language models with reinforcement learning.	2024-04-17 15:16:50

Found 2 out of 213,014. Showing 2 on page 1. Total pages: 1.